A revival of integrity constraints for data cleaning
نویسندگان
چکیده
Integrity constraints, a.k.a. data dependencies, are being widely used for improving the quality of schema. Recently constraints have enjoyed a revival for improving the quality of data. The tutorial aims to provide an overview of recent advances in constraint-based data cleaning.
منابع مشابه
Rank-based strategies for cleaning inconsistent spatial databases
A spatial dataset is consistent if it satisfies a set of integrity constraints. Although consistency is a desirable property of databases, enforcing the satisfaction of integrity constraints might not be always feasible. In such cases the presence of inconsistent data may have a negative effect on the results of data analysis and processing and, in consequence, there is an important need for da...
متن کاملA Interaction between Record Matching and Data Repairing
Central to a data cleaning system are record matching and data repairing. Matching aims to identify tuples that refer to the same real-world object, and repairing is to make a database consistent by fixing errors in the data by using integrity constraints. These are typically treated as separate processes in current data cleaning systems, based on heuristic solutions. This paper studies a new p...
متن کاملImplementing Query Rewriting for Consistent Query Answering in Databases
For several reasons, databases may be inconsistent with respect to a set of integrity constraints. Those inconsistent states must be somehow resolved in order to be able to use the information stored in them. In some cases, data cleaning could be an approach to get rid of these inconsistencies. However, this may be a complex and nondeterministic process that may lead to the loss of potentially ...
متن کاملCleaning trajectory data of RFID-monitored objects through conditioning under integrity constraints
A probabilistic framework is introduced for reducing the inherent uncertainty of trajectory data collected for RFID-monitored objects. The framework represents the position of an object at each instant as a random variable over the set of possible locations. The probability density function of this random variable is initialized according to an a-priori probability distribution, and then revise...
متن کاملProvenance Analysis for Missing Answers and Integrity Repairs
Data provenance approaches track how the answer to a database query derive from input items; however, prior approaches used “positive” provenance and were not directly usable for explaining “expected” but missing answers. A similar problem arises with the failure of integrity constraints. Our perspective is to offer explanations via possible (minimal) repairs using provenance. This is useful fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 1 شماره
صفحات -
تاریخ انتشار 2008